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(54) PIGMENT PROTEIN 

(57) An object of the present invention is to provide 
a novel chromoprotein derived from Cnidopus japon'h 
cus. The present invention provides a chromoprotein. 
derived from Cnidopus japonicus having the following 
properties: ^ 

(1) the absorption maximum wavelength is 610 nm, 
and fluorescence is not emitted; 



(2) the molar absorption coefficient is 66,700 at 61 0 
nm; and 

(3) the pH sensitivity of light-absorbing property is 
stable at between. pH 4 and pH 10. 
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Description 
TECHNICAL FIELD 

5 [0001] The present invention relates to a novel chromoproteln. More specifically, the present invention relates to a 
novel chromoproteln derived from Cnidopus japonicus, and the use thereof. 

BACKGROUND ART 

w [0002] Green fluorescent protein (GFP) derived from Aequorea victoria, a jellyfish, has many purposes in biological 
systems. Recently, various GFP mutants have been produced based on the random mutagenesis and semi-rational 
mutagenesis, wherein a color is changed, a folding property is improved, luminance is enhanced, or pH sensitivity is 
modified. Fluorescent proteins such as GFP are fused with other proteins by gene recombinant technique, and mon- 
itoring of the expression and transportation of the fusion proteins is carried out. 

15 [0003] One of the most commonly used types of GFP mutant is. Yellow fluorescent protein (YFP). Among Aequorea- 
derived GFP mutants, YFP exhibits the fluorescence with the longest wavelength. The values e and <I> of the majority 
. of YEPs are 60,000. to 100,000 M^cnr 1 and 0.6 to 0.8, respectively (Tsien, R. Y.(1998). Ann. Rev. Biochem. 67, 
509-544). These values are comparable to those of the general fluorescent group (fluorescein, rhodamine, etc.). Ac- 
cordingly, improvement of the absolute luminance of YFP is nearty approaching its limit. 

20 [0004] In addition, cyan fluorescent protein (CFP) is another example of the GFP mutant. Of this type of protein, 
ECFP (enhanced cyan fluorescent protein) has been known. Moreover, red fluorescent protein (RFP) has been isolated 
from sea anemone (Discoma sp.). Of this type of protein, DasRed has been known. Thus, 4 types of fluorescent 
proteins, that are, green fluorescent protein, yellow fluorescent protein, cyan fluorescent protein, and red fluorescent 
protein, have successively been developed. The range of the spectrum has significantly been expanded. 
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DISCLOSURE OF THE INVENTION 



[0005] An object of the present invention is to provide a novel chromoprotein derived from Cnidopus japonicus. 
[0006] The present inventors have conducted intensive studies directed towards achieving the aforementioned ob- 

30 ject. They have designed suitable primers based on information regarding the amino acid sequences of known fluo- 
rescent proteins. Using these primers, they have succeeded in the amplification and cloning of a gene encoding a 
novel chromoprotein from the cDN A library of Cnidopus japonicus exhibiting a green color. The present inventors have 
further analyzed the light-absorbing properties and pH sensitivity of the obtained chromoprotein derived from Cnidopus 
japonicus. The present invention has been completed based oh these findings. " 

35 [0007] That is to say, the present invention provides a chromoprotein derived from Cnidopus japonicus having the 
following properties: 

(1) the absorption maximum wavelength is 610 nm, and fluorescence is not emitted; 

(2) the molar absorption coefficient is 66,700 at 610 nm; and 

40 (3) the pH sensitivity of light-absorbing property is stable at between pH 4 and pH 1 0. 

[0008] In another aspect, the present invention provides a chromoprotein having either one of the following amino 
acid sequences: 

45 (a) the amino acid sequence shown in SEQ ID NO: 1 ; and 

(b) an amino acid sequence comprising a deletion, substitution and/or addition of one or several amino acids with 
respect to the amino acid sequence shown in SEQ ID NO: 1 , and having light-absorbing properties. 

[0009] In another aspect, the present invention provides a chromoprotein capable of emitting fluorescence, which 
so has an amino acid sequence, wherein, with respect to the amino acid sequence shown in SEQ ID NO: 1 , alanine as. 
an amino acid residue at position 28 is substituted by glycine, glutamic acid as an amino acid residue at position 41 is 
substituted by methionine, cysteine as an amino acid residue at position 145 is substituted by serine, and threonine 
as an amino acid residue at position 158 is substituted by isoleuctne. - 
[0010] In another aspect, the present invention provides: 



a chromoprotein having an amino acid sequence wherein tyrosine as an amino acid residue at position 64 is 

substituted by leucine with respect to the amino acid sequence shown in SEQ ID NO: 1 ; 

a chromoprotein having an amino acid sequence wherein tyrosine as an amino acid residue at position 64 is 
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substituted by methionine with respect to the amino acid sequence shown in SEQ ID NO: 1 ; 
a chromoprotein having an amino acid sequence, wherein glutamic acid as an amino acid. residue at position 41 
is substituted by leucine, and phenylalanine as an amino acid residue at position 80 is substituted by glycine, with 
respect to the amino acid sequence shown in SEQ ID NO: 1; 
s a chromoprotein capable of emitting fluorescence, which has an amino acid sequence wherein tyrosine as an 

amino acid residue at position 64 is substituted by phenylalanine with respect to the amino acid sequence shown 
in SEQ ID NO: 1; 

a chromoprotein capable of emitting fluorescence, which has an amino acid sequence wherein tyrosine as an 
amino acid residue at position 64 is substituted by histidine with respect to the amino acid sequence shown in 

10 SEQ ID NO: 1;and 

a chromoprotein capable of emitting fluorescence, which has an amino acid sequence, wherein cysteine as an 
amino acid residue at position 26 is substituted by valine, cysteine as an amino acid residue at position 143 is 
substituted by serine, and proline as an amino acid residue at position 199 is substituted by leucine, with respect 
to the amino acid sequence shown in SEQ ID NO: 1 . ... 

15 • - ' 

[0011] In another aspect, the present invention provides DNA encoding the protein of the present invention. 
[001 2] In another aspect, the present invention provides either one of the following DNAs: 

(a) DNA encoding the amino acid sequence shown in SEQ ID NO: 1 ; and 
20 (b) DNA encoding an amino acid sequence, which comprises- a deletion, . substitution and/or addition of one or 

several amino acids with respect to the amino acid sequence shown in SEQ ID NO:. 1 , and has light-absorbing 
properties. 

[0013] In another aspect, the present invention provides DNA having either one of the following nucleotide sequenc- 
es . es: 

(a) . the nucleotide sequence shown, in SEQ ID NO: 2; and ... 

(b) a nucleotide sequence comprising a deletion, substitution and/or addition of one or several nucleotides with 
respect to the nucleotide sequence shown in SEQ ID NO: 2, and encoding a protein having light-absorbing prop- 

30 erties. 

[001 4] In another aspect, the present invention provides DNA having the nucleotide sequence shown in any one of 
SEQ ID NOS: 12, 14, 16, 18, 20, or 22. 

[001 5] In another aspect, the present invention provides a recombinant vector.having the DNA of the present inven- 
ts tion. • 

[0016] In another aspect, the present invention provides a transformant having the DNA or recombinant vector of 
the present invention. 

[0017] In another aspect, the present invention provides a fusion protein composed of the chromoprotein of the. 
present invention and another protein. 

[0018] In another aspect, the present invention provides a method for analyzing a physiologically active substance, 
which is characterized in that the FRET (fluorescence resonance energy transfer) method is applied using the chromo- 
protein of the present invention as an acceptor protein. 

[0019] In another aspect, the present invention provides a light-absorbing reagent kit comprising the chromoprotein, 
DNA, recombinant vector, transformapt, or fusion protein of the present invention. 
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BRIEF DESCRIPTION OF THE DRAWINGS .. 
[0020] . 

Figure 1 shows the results obtained by measuring the absorption spectrum of the chromoprotein (KGr) derived 
from Cnidopusjaponicus of the present invention. The transverse axis indicates the wavelength of absorbed light, 
and the vertical axis indicates absorbance. ' 
Figure 2 shows the pH sensitivity of the absorption spectrum of the chromoprotein (KGr) derived from Cnidopus 
japonicus of the present invention. The transverse axis indicates pH value, and the vertical axis indicates absorb- 
ance. 61 0 nm indicates the absorbance that is specific to the chromoprotein (KGr) derived from Cnidopus japonicus 
of the present invention, and 277 nm indicates .the absorbance (light absorption by aromatic amino acids) that is 
. generally used as a quantitative amount of "protein. In other words, it is shown that the amount of protein is constant 
at 277 nm, and the absorbance at 610 nm that is specific to. the chromoprotein (KGr) derived from Cnidopus 
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japonicus of the present invention hardly changes in the range between pH 4 and pH 1 0. 
Figure 3 shows the fluorescence spectrum of a mutant of the chromoprotein (KGr) derived from Cnidopus japonicus 
of the present invention (a mutant obtained.by substituting A at position 28 by G, substituting E at position 41 by 
M, substituting C at position 145 by S, and substituting T at position 158 by I in the amino acid sequence of KGr). 
s The transverse axis indicates wavelength,, and the vertical, axis indicates fluorescence intensity. The term "em" 

indicates a fluorescence spectrum, and the term "ex" indicates an excitation spectrum. 

Figure 4 shows the absorption spectrum of a mutant of the chromoprotein (KGr) derived from Cnidopus japonicus 
of the present invention (a mutant obtained by substituting Y at position 64 by L in the amino acid sequence of KGr). 
Figure 5 shows the absorption spectrum of a mutant of the chromoprotein (KGr) derived from Cnidopus japonicus 
io of the present invention (a mutant obtained by substituting Y at position 64 by M in the amino acid sequence of KGr). 

Figure 6 shows the absorption spectrum of a mutant of the chromoprotein (KGr) derived from Cnidopus japonicus 
of the present invention (a mutant obtained by substituting Eat position 41 by L and substituting F at position 80 
by G in the amino acid sequence of KGr). 

Figure 7 shows the fluorescence and excitation spectrums of a mutant of the chromoprotein (KGr) derived from 
15 Cnidopus japonicus of the present invention (a mutant obtained by substituting Y at position 64 by F in the amino 

acid sequence of KGr). . . 

Figure 8 shows the absorption spectrum of a mutant of the chromoprotein (KGr) derived from Cnidopus japonicus 

of the present invention (a mutant obtained by substituting Y at position 64 by F in the amino acid sequence of KGr). 

Figure 9 shows the fluorescence and excitation spectrums of a mutant of the chromoprotein (KGr) derived from 
20 Cnidopus japonicus of the present invention (a mutant obtained by substituting Y at position 64 by H in the amino 

acid sequence of KGr). 

■ • . Figure 1 0 shows the absorption spectrum of a mutant of the chromoprotein (KGr) derived from Cnidopus japonicus 
. of the present invention (a mutant obtained by substituting Y at position 64 by H in the amino acid sequence of KGr). 
Figure 11 shows the fluorescence and excitation spectrums of a mutant of the chromoprotein (KGr) derived from 
25 Cnidopus japonicus of the present invention (a mutant obtained by substituting C at position 26 by V, substituting 

C at position 1 43 by S, and substituting P at position 1 99 by L in the amino acid sequence of KGr). . 
Figure 1 2 shows the absorption spectrum of a mutant of the chromoprotein (KGr) derived from Cnidopus japonicus 
of the present invention (a mutant obtained by substituting C at position 26 by V, substituting C at position 143 by 
S, and substituting P at position 199 by. L in the amino acid sequence of KGr). 

30 

BEST MODE FOR CARRYING OUT THE -INVENTION 



[0021] The embodiments of the present invention will be described in detail below^ 
3 s (1) Chromoprotein of the present invention 

[0022] The chromoprotein of the present invention is characterized in that it is derived from Cnidopus japonicus, and 
has the following properties: 

40 (1 ) the absorption maximum wavelength is 610 nm, and fluorescence is not emitted; 

(2) the molar absorption coefficient is 66,700 at 610 nm; and 

(3) the pH sensitivity of light-absorbing property is stable at between pH 4 and pH 10. 

[0023] Cnidopus japonicus is one type of sea anemone belonging to Anthozoa of Cnidaria. Among the types of sea 
45 anemone that can be seen in Japan, Cnidopus japonicus has the highest degree of color mutation. Its withers height 
is always low, and it has a large number of warts on the body wall thereof. It has approximately 200 short tentacles. A 
parent sea anemone discharges developed embryos from its oral part. The discharged embryos become attached to 
the body wall of the parent sea anemone. Thereafter, the embryos are further developed, and as a result, they become 
baby sea anemone. Thus, this sea anemone was named Komochilsoginchaku (a Japanese name meaning "seed sea 
50 anemone"). This type of sea anemone is distributed, in the intertidal zones on the rock coasts. between Hokkaido and 
Boso Peninsula and also in the zones immediately below the intertidal zones. 

[0024] It is to be noted that a chromoprotein having the aforementioned properties was isolated using Cnidopus 
japonicus as a starting material in the examples described later, but that the chromoprotein of the present invention 
may also be obtained from forms of sea anemone other than Cnidopus japonicus in some cases. Such a chromoprotein 
55 is also included in the scope of the present invention. 

[0025] As described in the examples below, the chromoprotein of the present invention has an absorption maximum 
wavelength of 610 nm and does not emit fluorescence. In addition, the present chromoprotein has a molar absorption 
coefficient of 66,700 at 61 0 nm. The molar absorption coefficient represents the amount of absorbed photons per mole 
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of molecule. Since the chromoprotein of the present invention does not emit fluorescence, the chromoprotein of the 
present invention can be used: (1) as an acceptor molecule (energy receptor) in FRET; (2) in development of a system 
for converting the energy of applied light into energy other than the light; and (3) in introduction of a mutation into the 
ammo acid sequence of the protein to modify it so that it emits fluorescence. 

[0026] In addition, the chromoprotein of the present invention is characterized in that the pH sensitivity of light- 
absorbing properties is stable at between pH 4 and pH 1 0. That is to say, in the case of the chromoprotein of the present 
invention, the peak value of the absorption spectrum does not significantly fluctuate in the range between pH 4 and 
pH 10. Accordingly, even under the same conditions, the chromoprotein of the present invention can be used in abroad 
range of pH environments, and thus, the use of the chromoprotein in vivo has few restrictions. 
[0027] The examples of the chromoprotein of the present invention include a chromoprotein having either one of the 
following amino acid sequences: 

(a) the amino acid sequence shown in SEQ ID NO: 1 ; and 

(b) an amino acid sequence comprising a deletion, substitution and/or addition of one or several amino acids with 
respect to the amino acid sequence shown in SEQ ID NO: 1 : and having light-absorbing properties. 

[0028] The scope of "one or several" in the phrase "an amino acid sequence comprising a deletion, substitution and/ 
or addition of one or several amino acids" is not particularly limited in the present specification. For example, it means 
1 to 20, preferably 1 to 10, more preferably 1 to 7, further preferably 1 to 5, and particularly preferably 1 to 3. 
[0029] The term -light-absorbing properties" is used in the present specification to mean properties capable of ab- 
sorbing light having a certain wavelength. For example, an absorption maximum wavelength may be 610 nm as in the 
case of the chromoprotein described in the present specification , or the value of the absorption maximum wavelength 
may also be shifted. It is preferable that the pH sensitivity of light-absorbing properties is stable at between pH 4 and 
pH 10. . . . 

25 [0030] The chromoprotein of the present invention having the amino acid sequence shown in SEQ ID NO: 1 intte 
sequence listing does not emit fluorescence. In the present invention, one or several amino acids are deleted substi- 
tuted, and/or added with respect to the amino acid sequence shown in SEQ ID NO: 1 , so as to produce a protein having 
modified light-absorbing properties, or so as to produce a protein emitting fluorescence in some cases. The thus pro- 
duced proteins are also inciuded.in the scope of the present invention. 
30 [0031 1] A specific example of a fluorescent protein produced by such mutatbn of amino acids may be a fluorescent 
protein having an amino acid sequence, wherein, with respect to the amino acid sequence shown in SEQ ID NO" 1 
alanine as an amino acid residue at position 28 is substituted by glycine, glutamic acid as an amino acid residue at 
position 41 is substituted. by methionine s cysteine as an amino acid residue at position 145 is substituted by serine, 
and threonine as an amino acid residue at position 158 is substituted by isoleucine. 
35 ■ [0032] Other specific examples of a fluorescent protein produced by such mutation of amino acids may include: a 
chromoprotein having an amino acid sequence wherein tyrosine as an amino acid residue at position 64 is substituted 
by leucine with respect to the amino acid sequence shown in SEQ ID NO: 1 ; a chromoprotein having an amino acid 
sequence wherein tyrosine as an amino acid residue at position 64 is substituted by methionine with respect to the 
amino acid sequence shown in SEQ ID NO: 1; a chromoprotein having an amino acid sequence, wherein glutamic - 
acid as.an amino acid residue at position 41Js substituted by leucine, and phenylalanine as an amino acid residue at 
position 80 is substituted by glycine, with respect to the amino acid sequence shown in SEQ ID NO: 1 ; a chromoprotein 
capable of emitting fluorescence, which has an amino acid sequence wherein tyrosine'as an amino acid residue at. 
position 64 is substituted by phenylalanine with respect to the amino acid sequence shown in SEQ ID NO: 1; a chromo- 
protein capable of emitting fluorescence, which has an amino acid sequence wherein tyrosine as an amino acid residue 
at position 64 is substituted by histidine with respect to the amino acid sequence shown in SEQ ID NO- 1" and a 
chrorrioprotein capable of emitting fluorescence, which has, an amino acid sequence, wherein cysteine as art amino 
acid residue at position 26 is substituted by valine, cysteine as an amino acid residue at position 1 43 is substituted by 
senne, and proline as an amino acid residue at position 199 is substituted by leucine, with respect to the amino acid 
sequence shown in SEQ ID NO: 1. 

[0033] The method of obtaining the chromoprotein of the present invention is not particularly limited. The protein 
may be either a protein synthesized by chemosynthesis, or recombinant protein produced by a gene recombination 
technique. 

[0034] Where a recombinant protein is produced, it is necessary to obtain DN A encoding the protein. Appropriate 
pnmers are designed by using information regarding the amino acid sequence shown in SEQ ID NO: 1 of the sequence 
listing of the present specification and the nucleotide sequence shown in SEQ ID NO: 2 thereof. Using these primers, 
PCR is carried out by using cDNA library derived from Cntddpus japonicus as a template, so that DNA encoding the 
chromoprotein of the present invention can be obtained. The chromoprotein of the present invention ban be produced 
by introducing this DNA into an appropriate expression system. Expression in an expression system will be described 
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later in the present specification. 

(2) DNA of the present invention 

5 [0035] According to the present invention, a gene encoding the chromoprotein of the present invention is provided. 
[0036] Specific examples of DNA encoding the chromoprotein of the present invention may include either one of the 
following DNAs: 

(a) DNA encoding the amino acid sequence shown in SEQ ID NO: 1 ; and 
10 (b) DNA encoding an amino acid sequence, which comprises a deletion, substitution and/or addition of one or 

several amino acids with respect to the amino acid sequence shown in SEQ ID NO: 1 , and has light-absorbing 
properties. 

[0037] Other examples of DNA encoding the chromoprotein of the present invention may include either one of the 
IS following DNAs: 

(a) the nucleotide sequence shown in SEQ ID NO: 2; and 

(b) a nucleotide sequence comprising a deletion, substitution and/or addition of one or several nucleotides with 
respect to the nucleotide sequence shown in SEQ ID NO: 2, and encoding a protein having light-absorbing prop- 

. 20 erties. 

' [0038] Further, examples of the DNA having the nucleotide sequence comprising a deletion, substitution and/or 
addition of one or several nucleotides with respect to the nucleotide sequence shown in SEQ ID NO: 2, and encoding 
a protein having light-absorbing properties, may include DNAhaving the nucleotide sequence shown in any one of 
.25 SEQ ID NOS: 12, 14, 16, 18, 20, or 22. 

[0039] The DNA of the present invention can be synthesized by, for example, the phosphoamidite method, or it can 
also be produced by polymerase chain reaction (PCR) using specific primers. The DNA of the present invention is 
. produced by the method described above in the specification. 

[0040] A method of introducing a desired mutation into a certain nucleic acid sequence is known to a person skilled 
. 30 in the art. For example, known techniques such as a site-directed mutagenesis, PCR using degenerated oligonucle- 
otides, or the exposure of cells containing nucleic acid to mutagens or radioactive rays, are appropriately used, so as 
to construct DNA having a mutation. Such known techniques are described in, for example, Molecular Cloning: A 
Laboratory Manual, 2 nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY., 1 989; and Current Protocols in 
Molecular Biology, Supplements 1 to 38, John Wiley & Sons (1987-1997). 
35 * 

, (3) Recombinant vector of the present invention 

[0041] The DNA of the present invention can be inserted into a suitable vector and used. The type of a vector used 
in the present invention is not particularly limited. For example, it may be either a vector that can autonomously replicate 
#> (e.g., a plasmid, etc.), or vector that is incorporated into the genomes of host cells when it is introduced into the host 
cells and is then replicated together with the chromosome into which it is incorporated. 

[0042] The vector used in the present invention is preferably an expression vector. In an expression vector; elements 
necessary for transcription (e.g., a promoter, etc.) are functionally ligated to the DNA of the present invention. The 
promoter is a DNA sequence which shows a transcriptional activity in host ceils, and it is appropriately selected de- 

45 pending on the type of host cells. 

[0043] Examples of a promoter which can operate in bacterial cells may include a Bacillus stearothermophilus mal- 
togenic amylase gene promoter, a Bacillus lichen! formis alpha-amylase gene promoter, a Bacillus amyloliquefaciens 
BAN amylase gene promoter, a Bacillus subtilis alkaline protease gene promoter, a Bacillus pumilus xylosidase gene 
promoter, P R and P L promoters of phage rhamda, and lac, trp and tac promoters of 

so ■ 

Escherichia coU. 

[0044] Examples of a promoter which can operate in mammalian cells may include an SV40 promoter, an MT-1 
(metailothionein gene) promoter, and an adenovirus-2 major late promoter. Examples of a promoter which can operate 
» in insect cells may include a polyhedrin promoter, a P 1 0 promoter, an Autographa califomica polyhedrosis basic protein 
. promoter, a bacuibvirus immediate-early gene 1 promoter, and a baculovirus 39K delayed-earty gene promoter. Ex- 
amples of a promoter which can be operate in yeast host cells may include promoters derived from yeast glycolytic 
genes, an alcohol dehydrogenase gene promoter a TPI1 promoter, and an ADH2-4c promoter. 
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[0045] Examples of a promoter which can operate in filamentous cells may include an ADH3 promoter and a tpiA 
promoter. . 

[0046] In addition, an appropriate terminator such as a human growth hormone terminator, or a TPI1 terminator or 
ADH3 terminator for fungal cells, may be functionally bound to the DNA of the present invention, as necessary. The 
recombinant vector of the present invention may further have elements such as a polyadenylation signal (e.g., one 
derived from SV40 or the adenovirus 5E1 b region), a transcription enhancer sequence (e.g., an SV40 enhancer), or 
a translation enhancer sequence (e.g., one encoding the adenovirus VA RNA). 

[0047] The recombinant vector of the present invention may further comprise a DNA sequence which enables the 
replication of the recombinant vector in host cells. SV40 replication origin is an example of such a sequence (when 
the host cells are mammalian cells). 

[0048] The recombinant vector of the present invention may further comprise a selective marker. Examples of such 
a selective marker may include genes, complements of which are absent from host cells, such as a dihydrofolate 
reductase (DHFR) gene or a Shizosaccharomyces pqmbe TPI gene, and drug resistant genes such as ampicillin, 
kanamycin, tetracycline, chloramphenicol, neomycin or hygromycin-resistant genes. 

[0049] A method for ligating the DNA of the present invention, a promoter and, as desired, a terminator and/or a 
secretory signal sequence to one another and inserting these items into a suitable vector is known to a person skilled 
in the art. 

(4) Transformant of the present invention 

[0050] A transformant can be produced by introducing the DNA or recombinant vector of the present invention into 
a suitable host. 

[0051] Any cell can be used as a host cell into, which the DNA or recombinant vector of the present invention is 
. introduced, as long as.the DNA construct of the present invention can be expressed therein. Examples of such a cell 
may include bacteria, yeasts, fungal cells, and higher eukaryotic cells. 

[0052] Examples of bacteria may include Gram-positive bacteria such as Bacillus orStreptomyces, and Gram-neg- 
ative bacteria such as Escherichia coli. These bacteria may be transformed by the protoplast method or other known 
methods, using competent cells. 

[0053] Examples of mammalian cells may include HEK 293 cells, HeLa cells, COS cells, BHK celis, CHL cells, and 
CHO cells. A method of. transforming mammalian cells and expressing the introduced DNA sequence in the cells is 
also known. Examples of such a method may include the electroporation, the calcium phosphate method, and the 
lipofection method. 

[0054] Examples of yeast cells may include those belonging to Saccharomyces or Shizosaccharomyces. Examples 
of such cells may include Saccharomyces cerevisiae and Saccharomyces ktuyveri. Examples of a method of introducing 
a recombinant vector into yeast host cells may include the electroporation, the spheroplast method, and the lithium 
acetate method. 

[0055] Examples of other fungal cells may include those belonging to Filamentous fungi such as Aspergillus, Neu- 
rpspora, Fusarium or Trichoderma. Where Filamentous fungi are used as host cells, transformation can be carried out 
by incorporating DNA constructs into host chromosomes, so as to obtain recombinant host cells. Incorporation of DNA 
constructs into the host chromosomes is carried out by known methods, and such known methods may include ho- 
mologous recombination and heterologous recombination. 

[0056] Where insect cells are used as host cells, both a vector into which a recombinant gene is introduced and a 
baculovirus are co-introduced into insect cells, and a recombinant virus is obtained in the culture supernatant of the 
insect cells. Thereafter, insect cells are infected with the recombinant virus, so as to allow the cells to express proteins 
(described in, for example, Baculovirus Expression Vectors, A Laboratory Manual; and Current Protocols in Molecular 
Biology, Bio/Technology, 6, 47 (1988)). 

[0057] .The Auiographa califomica nuclear polyhidrosis virus, which is a virus. infecting to insects belonging to Bar- 
athra brassicae, can be used as baculovirus. 

[0058] Examples of insect cells used herein may. include Sf9 and Sf21, which are Spodoptera frugiperda ovarian 
cells [Bacu|ovirus Expression Vectors, A Laboratory Manual, W. H. Freeman & Company, New York, (1 $92)], and HiFive 
(manufactured by Invitrogen), which are Trichoplusia ni ovarian cells. 

[0059] . Examples of the method of co-introducing both a vector into which a recombinant gene has been introduced 
and the above baculovirus into insect cells to prepare a recombinant virus may include the.calcium phosphate method 
and the lipofection method. 

[0060] The above transformant is cultured in an appropriate nutritive medium under conditions enabling the intro- 
duced DNA construct to be expressed. In order to isolate and purify the protein of the present invention from the culture 
product of the transformant, common methods of isolating and purifying proteins may be used. 
[0061] For example, where the protein of the present invention is expressed in a state dissolved in cells, after com- 
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pletion of the culture, ceils are recovered by centrifugal separation, and the recovered cells are suspended in a water 
type buffer. Thereafter, the cells are disintegrated using an ultrasonic disintegrator or the like, so as to obtain a cell- 
free extract. A supernatant is obtained by centrifuging the cell-free extract, and then, a purified sample can be obtained 
from the supernatant by applying, singly or in combination, the following ordinary protein isolation and purification 

5 methods: the solvent extraction, the salting-out method using ammonium sulfate or the like, the desalting method, the 
precipitation method using an organic solvent, the anion exchange chromatography using resins such, as diethylami- 
noethyl (DEAE) sepharose, the cation exchange chromatography using resins such as S-Sepharose FF (manufactured 
by Pharmacia), the hydrophobic chromatography using resins such as butyl sepharose or phenyl sepharose, the gel 
filtration method using a molecular sieve, the affinity chromatography, the chromatofocusing method, and the eiectro- 

10 phoresis such as isoelectric focusing. 

(5) Use of the chromoprotein of the present invention and a fusion protein comprising the same 

[0062] The chromoprotein of the present invention can be fused with another protein, so as to construct a fusion 
15 protein. The type of said another protein to be fused to the chromoprotein of the present invention is not particularly 
limited, and preferred examples may include a protein which interacts with another molecule. The examples may include 
a receptor protein or ligand thereof, antigen, antibody and the like. 

[0063] A method of obtaining the fusion protein of the present invention is not particularly limited. It may be either a 
protein synthesized by chemosynthesis, or recombinant protein produced by a gene recombination technique. 

20 [0064] Where a recombinant fusion protein is produced, it is necessary to obtain DNA encoding the protein. The 
DNA encoding the chromoprotein of the present invention and the DNA encoding the another protein to be fused to 
the chromoprotein, can be obtained by the method as mentioned above in this specification or by the method similar 
to it; Then, these DNA fragments are ligated to one another by a gene recombination technique, so that DNA encoding 
the desired fusion protein can be obtained. This DNA is then introduced into an appropriate expression system, so that 

25 the fusion protein of the present invention can be produced: 

[0065] FRET (fluorescence resonance energy transfer) has been known as a means for analyzing the interaction 
between molecules. In FRET, for example, a first molecule labeled with a cyan fluorescent protein (CFP) acting as a 
first fluorescent protein is allowed to coexist with a second molecule labeled with a yellow fluorescent protein (YFP) 
acting as a second fluorescent protein 5 so as to allow the yellow fluorescent protein (YFP) to act as an acceptor molecule 

30. and to allow the cyan fluorescent protein (CFP) to act as a donor molecule. Thus, FRET (fluorescence resonance 
energy transfer) is allowed to take place between both molecules, so as to visualize the interaction between the first 
and second molecules. Namely, in FRET, different dyes are introduced into two types of molecules. One dyes with a 
higher energy level (a donor molecule) is selectively excited, and the fluorescence of the dye is measured. Long- 
. wavelength fluorescence from the other dye (an acceptor molecule) is also measured. The interaction between the 

35 molecules is visualized by using the difference between the amounts of both fluorescences. Only when both dyes are 
adjacent to each other due to the interaction of the two types of molecules, a decrease in the fluorescence of the donor 
molecule and an increase in the fluorescence of the acceptor molecule are observed by single wavelength excitation 
dual wavelength photometry. However, in a case where a chromoprotein is used as an acceptor molecule, a decrease 
in the fluorescence of the donor molecule occurs only when both dyes are adjacent to eacjh other by the interaction of 

40 the two types of molecules. Such a decrease can be observed by single wavelength excitation single wavelength 
photometry. Thus, the use of the chromoprotein of the present invention enables facilitation of measurement appara- 
tuses. 

[0066] The chromoprotein of the present invention is particularly advantageous when it is used as an acceptor molr 
ecule in FRET (fluorescence resonance energy transfer). That is to say, a fused form (a first fused form) of the chromo- 

45 protein of the present invention and a test substance is first produced. Then, a fused form (a second fused form) of 
another test substance interacting with the above test substance and another fluorescent protein is produced. There- 
after, the first fused form is allowed to interact with the second fused form, and the generated fluorescence is analyzed, 
so that the interaction between the aforementioned two types of test substances can be analyzed. FRET (fluorescence 
resonance energy transfer) using the chromoprotein of the present invention may be earned out either in a test tube 

50. or in a cell. 

(6) Kit of the present invention 

[0067] The present invention provides a light-absorbing reagent kit comprising at least one which is selected from 
55 the chromoprotein, fusion protein, DNA, recombinant vector or transformant, which are described in the present spec- 
ification. The kit of the present invention can be produced from commonly used materials that are known per se, by 
using common methods. 

[0068] Reagents such as the chromoprotein or the DNA are dissolved in an appropriate solvent, so that the reagents 
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can be prepared in a form suitable for conservation. Water, ethanol, various types of buffer solution, etc. can be used 
as such a solvent. 

[0069] The present invention will be further described in the following examples. However, the present invention is 
not limited by these examples. 

5 

EXAMPLES 

Example 1: Isolation of gene encoding novel chromoprotein from sea anemone 

10 (1 j Extraction of total RNA 

[0070] A chromoprotein gene was isolated from sea anemone emitting a green color. Cnidopus japonicus emitting 
. a green color was used as a material. Frozen Cnidopus japonicus was crushed in a mortar. 7.5 ml of TRIzol" (GIBCO 
BRL) was added to .1 g (wet weight) of Cnidopus japonicus, and the mixture was homogenized, followed by centrifu- 
15 . gation at 1 ,500 x g for 1 0 minutes. 1.5 ml of chloroform was added to the supernatant. The mixture was stirred for 15 
seconds and then left at rest for 3 minutes. The resultant product was centrifuged at 7,500 x g for 1 5 minutes. 3.75 ml 
of isopropanol was added to the supernatant. The mixture was stirred for 1 5 seconds and then left at rest fori 0 minutes. 
The resultant product was centrifuged at 17,000 x g for 1 0 minutes. The supernatant was discarded, and 6 ml of 70% 
. ethanol was added thereto. The obtained mixture was centrifuged at 1 7,000 x g for 10 minutes. The supernatant was 
.20 discarded, and the precipitate was dissolved in 200 uJ of DE PC water. Total RNA dissolved in the DEPC water was 
100 times diluted, and the values of O.D.260 and O.D.280 were measured, so as to determine the concentration of 
RNA. 1.2 mg of the total RNA was obtained from a green individual. ' 

(2) Synthesis of first stand cDN A 

[0071 ] cD N A (33 jil) was synthesized from 4 jig of the total RNA using a kit for synthesizing first strand cDN A, "Ready 
To Go" (Amersharn Pharmacia). 

. (3) Degenerated PCR 

30 ' *• ' - * ■ " * . 

[0072] Using 3 uJ but of the synthesized first strand cDN A (33 uJ) as a template, PCR was carried out. Primers were 
designed and produced by comparing the amino acid sequences of known fluorescent proteins, extracting similar 
. portions, and converting them into nucleotide sequences. The sequences of the used primers. are shown below: 

' •' 35 ■ '-' ■ 

5'-GGIGSICCIHTISCrrT-3' (primer i) (SEQ ID NO: 3); 

and • 
40 ' 

: 5'-AACTGGAAGAATTCGCGGCCGCAGAAll l ITT 1 11 1 1 1 ITTTTT-3' (primer 2) 

(SEQ ED NO: 4), 

45 

wherein I represents inosine, S represents C or G, and H represents A, T, or C. . 



so. . 



Composition of PCR reaction solution 


Template (first strand cONA) 


3 p.1 


X 10 taq buffer 


5uJ 


2.5mMdNTPs . .. 


4u.l 


1 00 uM primer 3 


iv\ 


1 00 uM primer 4 


1 jxl. 


MilliQ 


35 nl 


Taq polymerase (5 U/uJ) 


1HI 
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RCR reaction conditions 
94°C, 1 minute (PAD) 
94°C, 30. seconds (den atu ration) . 
52°C, 30 seconds (annealing of the primers to the template) 
. 5 72°C, 1 minute (elongation of the primers) 

[0073] 30 cycles consisting of the above 3 steps were carried out The annealing temperature was decreased 0.3°C 
per cycle. That is to say, the annealing temperature in the 30 th cycle was 43°C. 
72°C, 7 minutes (final elongation) 
10 Retention at 4°C 

[0074] Using 1 nl of an amplified product obtained as a result of the first PCR reaction as a template, PCR was 
. carried out once again under the same conditions. A 800-bp fragment (derived from the green individual) was cut out 
by agarose gel electrophoresis and then purified. This 800-bp fragment contained a 3'4JTR portion as a whole. 

*5 (4) Subcloning and sequencing 

[0075] the purified DNA fragment was ligated to a pT7-blue vector (Novagen). Escherichia colt (TG1) was trans- 
formed with the vector, and the obtained transformants were subjected to blue white selection. Thereafter, plasmid 
DNA was purified from white colonies of Escherichia coll The nucleotide sequence of the inserted DNA fragment was 
20 determined by a DNA sequencer. The obtained nucleotide sequence was compared with the nucleotide sequences of 
other fluorescent protein genes to confirm that the nucleotide sequence of the DNA was derived from a fluorescent 
protein. S'-RACE and 3'-RACE methods were applied to a gene that had been confirmed to be a part of a fluorescent 
protein gene, so as to carry out the cloning of a full-length gene. 

25 (5) S'-RACE method 

[0076] In order to determine the nucleotide sequence of the 5' -terminal side of the DNA fragment obtained by de- 
generated PCR, the 5'-RACE method was applied using S'-RACE System for Rapid Amplification of cDNA Ends, Ver- 
sion 2.0 (GIBCO BRL). 3 jig of the total RNA prepared in (1) above was used as a template. 
30 [0077] . For the first amplification of DC-tailed cDNA of the green individual* the following primers were used: 



5'-GGCCACGCGTCGACTAGTACGGGIIGGGIIGGGIIG-3' (primer 3) (SEQ ED NO: 

35 5); 

and \ • 

40 . 5'-AGACGAGGCAATTTCCAT CAAG-3' (primer 4) (SEQ ID NO: 6), 

wherein I represents inosine. 

[0078] For the second amplification, the following primers were used: 

45 . 

5'-GGCCACGCGTCGACTAGTAC-3' (primer 5) (SEQ ED NO: 7); 

and 

50. 

5'-GGCTACGCTTCCATATTGGCAGTT-3' (primer 6) (SEQ ED NO: 8). 

PCR reaction conditions and the like were determined in accordance with the protocols attached with the kit. 
[0079] The 350-bp amplified band was cut out by agarose gel electrophoresis and then purified. The purified DNA 
55 fragment was ligated to a pT7-blue vector (Novagen). Escherichia coli (TGI) was transformed with the vector, and the 
obtained transformants were subjected to blue white selection. Thereafter, plasmid DNA was purified from white col- 
onies of Escherichia coli. The nucleotide sequence of the inserted DNA fragment was determined by a DNA sequencer. 
The entire nucleotide sequence is shown in SEQ ID NO: 2, and the entire amino acid sequence is shown in SEQ ID 
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NO: 1. 



Example 2: Expression of protein In Escherichia colt 

[0080] Primers corresponding to the N- and C-termini of the protein were produced from the obtained full-length 
nucleotide sequence. PCR was carried out using the primers and the first strand cDNA prepared in (2) above as a 
template. The used primers are as follows: 



w 



5'-CGGGATCCGACCATGGCTTCC AAAATCAGC-3' (primer 7) (SEQ ID NO: 9); 



and 



15 



5'-CCGGAATTCTTAATTGTGACCAAGTTTAGATGGGCA-3 r (primer 8) (SEQ ID 
NO: 10). 



20 



25 



30 



35 



40 



45 



50 



55 



Composition of PCR reaction solution - 


Template (first strand cDNA) 


3 uJ 


X 1 0 pyrobest buffer 


5ull 


2.5mMdNTPs 


4uJ 


100 jiM primer 7 . 


1uJ 


100 pA/l primer 8 


1uJ 


MilliQ 


35 uJ 


Pyrobest polymerase (5 U/jil) 


1ui 



RCR reaction conditions 

94° C, 1 minute (PAD) 

94° C, 30 seconds (denatu ration) 

55°C, 30 seconds (annealing of the primers to the template) 
72°C, 1 minute (elongation of the primers) 

[0081] 30 cycles consisting of the above 3 steps were carried out. 
72°C, 7 minutes (final elongation) 
Retention at 4°C 

[0082] An amplified band of approximately 700 bp was cut out by agarose gel electrophoresis and then purified. The 
purified ONA fragment was subcloned into the Banti\-EcoR\ site of a pRSET vector (Invitrogen); and it was then 
allowed to express in Escherichia coli (JM109-DE3). Since the expressed protein was constructed such that His-tag 
was attached to the N-terminus thereof, it was purified with Ni-Agarose gel (QIAGEN). Purification was carried out in 
accordance with the attached protocols. , 

Example 3: Analysis of protein 

(1 ) Analysis of light-absorbing properties 

[0083] The light-absorbing properties of the protein expressed in Example 2 were analyzed. 
[0084] An absorption spectrum was measured using a 50 mM HEPES solution (pH 7.5) containing a 20 chromo- 
protein. A molar absorption coefficient was calculated from the peak value of this spectrum. In the case of the chromo- 
protein derived from the green individual (referred to as KGr), the absorption peak was observed at 610 nm, and no 
fluorescence was detected (Table 1, Figure 1). 
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Table 1 



Properties of chron 


loprotein (KGr) 


Absorption 
maximum 


Fluorescence 
maximum 


Molar absorption 
coefficient 


quantum yield 


pH sensitivity 


Number of amino 
acids 


610 nm 




66,700 (610 nm) 




Non 


232 



10 



15 



(1) Measurement of pH sensitivity 

The pH sensitivity of the protein expressed in Example 2 was analyzed. 

The absorption spectrum of the protein was measured in the following 100 mM buffer solution (Figure 2). 
The following buffer solutions were used for each pH: 

. pH 4 and 5: Acetate buffer 
pH 6: MES buffer 
pH7and8:HEPESbuffer 
pH 9 and .10: Glycine buffer 



20 



25 



30 



35 



40 



45 



50 



55 



The peak value did not significantly change at any pH. 
Example 4: Modification of KGr 

[0085] In KGr, A at position 28 was substituted by G, E at position 41 was substituted by M, C at position 1 45 was 
substituted by S, and T at position 158 was substituted by I, so that the KGr was modified to have an absorption peak 
at 444 nm and to emit yellow fluorescence with a peak at 534 nm (Rgure 3). 

Example 5: Modification of properties of KGr by. amino acid substitution 

[0086] Y at position 64 that is a chromopho re-forming amino acid (QYG) of KGr was substituted by L or M, so that 
the absorption peak became 418 nm and so that the absorption peak was shifted from the original absorption peak at 
61 0 nm to the shorter wavelength side (Figures 1 and 2). The amino acid sequence of the protein wherein Y at position 
64 was substituted by L is shown in SEQ ID NO: 11 , and the nucleotide sequence thereof is shown in SEQ ID NO: 12. 
The amino acid sequence of the protein wherein Y at position 64 was substituted fay M is shown in SEQ ID NO: 13, 
and the nucleotide sequence thereof is shown in SEQ ID NO: 14. 

[0087] E at position 41 was substituted by L, and F at position 80 was substituted by G, so that the absorption peak 
become 528 nm and so that the absorption peak was shifted from the original absorption peak at 61 0 nm to the shorter 
wavelength side (Figure 3). The amino acid sequence of this protein is shown in SEQ ID NO: 15, and the nucleotide 
sequence thereof is shown in SEQ ID NO: 16. 

[0088] Y at position 64 that is a chromophore-forming amino acid (QYG) was substituted by F, so that the absorption 
peak became 41 2 nm, and so that the absorption peak was shifted from the original absorption peak at 61 0 nm to the 
shorter wavelength side and the protein was further modified to emit fluorescence with a peak at 504 nm (Figures 4 
and 5). The amino acid sequence of this protein is shown in SEQ ID NO: 17, and the nucleotide sequence thereof is 
shown in SEQ ID NO: 18. 

[0O89] Y at position 64 that is a chromophore-forming amino acid (QYG) was substituted by H, so that the absorption 
peak became 41 8 nm, and so that the absorption peak was shifted from the original absorption peak at 61 0 nm to the 
shorter wavelength side and the protein was further modified to emit fluorescence with a peak at 520 nm (Figures 6 
and 7). The amino acid sequence of this protein is shown in SEQ ID NO: 19, and the nucleotide sequence thereof is 
shown in SEQ ID NO: 20. 

[0090] C at position 26 was substituted by V, C at position 143 was substituted by S, and P at position 199 was 
substituted by L, so that the absorption peak became 585 nm, and so that the absorption peak was shifted from the" 
original absorption peak at 61 0 nm to the shorter wavelength side and the protein was further modified to emit fluo- 
rescence with a peak at 625 nm. This fluorescent protein was defined as KGr Rb (Figures 8 and 9). The amino acid 
sequence of this protein is shown in SEQ ID NO: 21 , and the nucleotide sequence thereof is shown in SEQ ID NO: 22. 

INDUSTRIAL APPLICABILITY 

[0091 ] The present invention provides a novel chromoprbtein derived from Cnidopus japonicus. The chromoprotein 
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of the present invention has desired fluorescence properties and low pH sensitivity. Thus, it is useful for molecular 
biology analysis. 
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SEQUENCE LISTING 
<110> RIKEN 
<120> Chromo protein 
<130> A31347A 
<160> 22 

<210> 1 
<211> 232 
<212> PRT 

<213> Cnidopus japonicus 



<400> 1 

Met Ala Ser Lys lie Ser Asp Asn Val Arg He Lys Leu Tyr Met Glu 

1 5 10 IS 

Gly Thr Val Asn Asn His His Phe Met Cys Glu Ala Glu Gly Glu Gly 

20 25 30 

Lys Pro Tyr Glu Gly Thr Gin Met Glu Asn lie Lys Val Thr Lys Gly 

35 40 45 

Gly Pro Leu Pro Phe Ser Phe Asp He Leu Thr Pro Asn Cys Gin Tyr 

50 55 60 

Gly Ser Val Ala lie Thr Lys Tyr Thr Ser Gly lie Pro Asp Tyr Phe 
65 70 75 80 

Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr lie Tyr 

85 90 95 

Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 

100 105 110 

Asn Cys Leu Val Tyr Asn lie Lys He Leu Gly Cys Asn Phe Pro Pro 
• 115 120 125 
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Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Cys Cys 

130 135 140 

Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 
145 150 155 160 

Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 

165 170 175 

Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 185 190 

His Phe Ser Asp His Arg Pro Glu He Yal Lys Val Ser Glu Asn Gly 

195 200 205 

Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 

210 215 220* 

Cys Pro Ser Lys Leu Gly His Asn 
225 230 



<210> 2 

3 5 <211> 699 * 

<212> DNA 

<213> Cnidopus japonicus 

■ 40 ' " ' 

<400X 2 

. atg get tec aaa ate age gac aat gta cgt ate aag tta tat atg gag 48 
& Met Ala Ser Lys He Ser Asp Asn Val Arg lie Lys Leu Tyr Met Glu 

/:• . 1 . S 10 15 

ggc aca gtc aac aat cat cac ttc atg tgc gaa get gaa gga gag ggc 96 
Gly Thr Val Asn Asn His His Phe Met Cys Glu Ala Glu Gly Glu Gly 
20 25 . 30 . 

' aag. cca tac gag gga act caa atg gag aac ata aaa gtc ace aaa gga 144 
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Lys Pro Tyr Glu Gly Thr Gin Met Glu Asn lie Lys Val Thr Lys Gly 

35 40 45 

ggc cct ctg ccg ttc tct ttt gat ate ttg acg cct aac tgc caa tat 192 
Gly Pro Leu Pro Phe Ser Phe Asp He Leu Thr Pro Asn Cys Gin Tyr 

50 55 60 

gga age gta gee ata ace aag tat aca tea ggg att cca gac tac ttt 240 
is Gly Ser Yal Ala. He Thr Lys Tyr Thr Ser Gly He Pro Asp Tyr Phe 

65 70 75 80 

aag caa tct ttt cct gaa gga ttt acc tgg gaa aga acc aca ate tac 288 
Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr He Tyr 
85 90 95 

25 gaa gat ggg get tac ctt aca act caa caa gaa acc aaa ctt gat gga 336 

Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 
100 105 HO 

30 

aat tgc etc gtc tac aat att aaa ate ctt gga tgt aat ttt ccc ccc 384 
Asn Cys Leu Val Tyr Asn lie Lys lie Leu Gly Cys Asn Phe Pro Pro 
35 115 120 125 

aat ggt cct gtg atg cag aag aaa acc caa ggc tgg gaa ccc tgt tgc 432 
Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Cys Cys 

40 

130 135 . . 140 

gag atg. cgc tat aca cgt gat ggt gtg eta tgt ggc caa aca.tta atg 480 
45 Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 

145 150 155 160 

gca ctt aaa tgc gec gat ggg aac cac etc act tgc cat ctg aga act 528 
Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 
165 170 175 , 

act tac agg tec aaa aag gca gca aag gcg ttg cag atg cca ccc ttc 576 



so 
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Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 185 190 

cat ttt tea gac cat cgt cct gaa ata gtg aag gtt tea gag aac ggc 624 
His Phe Ser Asp His Arg Pro Glu He Val Lys Val Ser Glu Asn Gly 

195 200 205 

aca eta ttt gaa cag cac gaa agt tea gtg gec agg tac tgt caa aca 672 
15 Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Glh Thr 

210 215 220 

. tgc cca tct aaa ctt ggt cac aat taa 699 
Cys Pro Ser Lys Leu Gly His Asn 
225 230 

25 • - : ' ' ' • • . 

<210> 3 . 
<211> 17 
<212> DNA. 

<213> Artificial Sequence 
35 <220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 3 

.40 

t ggigsiccih tiscitt 17 

45 <210> 4 

<211> 44 



50 



55 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic DNA 
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<400> 4 

aactggaaga attcgcggcc gcagaatttt tttttttttt tttt 44 

<210> 5 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Synthetic DNA 
<40O> 5 

ggccacgcgt cgactagtac gggiigggii gggiig 36 



<210> 6 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
35 <220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<40O> 6 

agacgaggca atttccatca ag 22 



45 <210> 7 

<211> 20 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
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. <400> 7 

ggccacgcgt cgactagtac 20 

<210> 8 --.'••< ,.• 
<211> 24 
<212> DNA 

i 5 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<4O0> 8 

ggctacgctt ccatattggc agtt 24 

25 • 

<210> 9 
- <211> 30 

30 

<212> DNA 

<213> Artificial Sequence 
35 <220> • 

<223> Description of Artificial Sequence: Synthetic DNA 
' ; <400> 9 

40 

/ cgggatccga ccatggcttc caaaatcagc 30 



45 



<210> 10 
<211> 36 
so <212> DNA 



55 



<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
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<400> 10 

ccggaattct taattgtgac caagtttaga tgggca 36 

<210> 11 
<211> 232. 
<212> PRT 
15 <213> Cnidopus japonicus 

<400> 11 

Met Ala Ser Lys lie Ser Asp Asn Val Arg lie Lys Leu Tyr Met Glu 

1 5 10 15 

Gly Thr Val Asn Asn His His Phe Met Cys Glu Ala Glu Gly Glu Gly 
25. 20 25 30 

Lys Pro Tyr Glu Gly Thr Gin Met Glu Asn lie Lys Val Thr Lys Gly 

35 40 45 

Gly Pro Leu Pro Phe Ser Phe Asp He Leu Thr Pro Asn Cys Gin Leu 

50 55 60 

Gly Ser Val Ala He Thr Lys Tyr Thr Ser Gly He Pro Asp Tyr Phe 
65 70 75 80 

Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr lie Tyr 

85 ; 90 95 

Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp GLy 

100 105 110 

Asn Cys Leu Val Tyr Asn lie Lys He Leu Gly Cys Asn Phe Pro Pro 

115 120 125 

Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Cys Cys 

130 135 144 

Glu Met Arg Tyr Thr. Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 
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145 150 155 . 160 . 

Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 

165 170 175 

Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 185 190 

His Phe Ser Asp His Arg Pro Glu He Val Lys Val Ser Glu Asn Gly 

195 200 , 205 

Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr '.' 

210 . 215 220 

Cys Pro Ser Lys Leu Gly His Asn 
225 '•: 230 

<210> 12 
<211> 699 . . 
<212> DNA 

<213> Cnidopus japonicus 
<400> 12 

atg get tec aaa ate age gac aat gta cgt ate aag tta tat atg gag 48 
Met Ala Ser Lys He Ser Asp Asn Val Arg He Lys Leu Tyr Met Glu 

1 5 - 10 . 15 

ggc aca gtc aac aat cat cac ttc atg tgc gaa get gaa gga gag ggc 96 
Gly Thr Val Asn Asn His His Phe Met Cys Glu Ala Glu Gly. Glu Gly 

. 20 25 30 ' ' . 

aag cca tac gag gga act caa atg gag aac ata aaa gtc acc aaa gga 144 
Lys Pro Tyr Glu Gly Thr Gin Met Glu Asn He Lys Val Thr. Lys Gly 

35 40 45 

ggc cct ctg ccg ttctct ttt gat ate ttg acg cct aac tgc caa ctt 192 
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Gly Pro Leu Pro Phe Ser Phe Asp He Leu Thr Pro Asn Cys Gin Leu 

50 55 60 

gga age gta gec ata acc aag tat aca tea ggg att cca gac tac ttt 240 
Gly Ser Val Ala lie Thr Lys Tyr Thr Ser Gly lie Pro Asp Tyr Phe 
65 .70 75 80 

aag caa tct ttt cct gaa gga ttt acc tgg gaa aga acc aca ate tac 288 
Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr He Tyr 

85 90 95 

gaa gat ggg get tac ctt aca act caa caa gaa acc aaa ctt gat gga 336 
Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 

100 105 110 

aat tgc etc gtc tac aat att aaa ate ctt gga tgt aat ttt ccc ccc 384 
Asn Cys Leu Yal Tyr Asn He Lys He Leu Gly Cys Asn Phe Pro Pro 

115 120 125 

aat ggt cct gtg atg cag aag aaa acc caa ggc tgg gaa ccc tgt tgc 432 
Asn Gly Pro Yal Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Cys Cys 

130 135 144 

gag atg cgc tat aca cgt gat ggt gtg eta tgt ggc caa aca tta atg 480 
Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 
145 150 155 16Q 

gca ctt aaa tgc gec gat ggg aac cac etc act tgc cat ctg aga act 528 
Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 

165 170 175 

act tac agg tec aaa aag gca gca aag gcg ttg cag atg cca ccc ttc 576 
Thr Tyr Arg Ser. Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 185 190 

cat ttt tea gac cat cgt cct gaa ata gtg aag gtt tea gag aac ggc 624 



. EP 1 512 744 A1 

His Phe Ser- Asp His Arg Pro Glu lie Val Lys Val Ser Clu Asn Gly 

195 200' 205 

aca eta ttt gaa cag cac gaa agt tea gtg gee agg tac tgt caa aca 672 
Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 

210 21.5 220 

tgc cca tct aaa ctt ggt cac aat taa 699 
Cys Pro Ser Lys Leu. Gly His Asn 
225 ' 230 

<210> 13 
<211> 232 
<212> PRT 

<213> Cnidopus japonicus 
<400> 13 

Met Ala Ser Lys lie Ser Asp Asn Yal Arg lie Lys Leu Tyr Met Glu 

1 5 10 15 

Gly Thr Val Asn Asn His His Phe Met Cys Glu Ala Glu Gly Glu Gly 

20 . 25 30 

Lys Pro Tyr Glu Gly Thr Gin Met Glu Asn lie Lys Val Thr Lys Gly 

35 40 45 

Gly Pro Leu Pro Phe Ser Phe Asp lie Leu Thr Pro Asn Cys Gin Met 

50 55 60 

Gly Ser Val Ala lie Thr Lys Tyr Thr . Ser Gly He Pro Asp Tyr Phe 
65 70 75 80 

Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr . Thr He Tyr 

85 90 95 

Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 
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.100 105 110 

Asn Cys Leu Yal Tyr Asn lie Lys lie Leu Gly Cys Asn Phe Pro Pro 

115 120 125 

Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Cys Cys 

130 135 140 

Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 
145 150 155 160 

Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 

165 170 175 

Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 
180 185 190 

25 His Phe Ser Asp His Arg Pro Glu lie Val Lys Val Ser Glu Asn Gly 

195 200 205 

Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 

210 215 220 

Cys Pro Ser Lys Leu Gly His Asn 
225 230 



to 
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<210> 14 
<2ll> 699 
<212> DNA 

<213> Cnidopus japonicus 
<400> 14 

atg get tec aaa ate age gac aat gta cgt ate aag tta tat atg gag 48 
Met Ala Ser Lys lie Ser Asp Asn Val Arg lie Lys Leu Tyr Met Glu 

1 . 5 10 . 15 

ggc aca gtc aac aat cat cac ttc atg tgc gaa get gaa gga gag ggc 96 
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Gly Thr Val Asn Asn His His Phe Met Gys Glu Ala Glu Gly Glu Gly 

20 25 30 

aag cca tac gag gga act caa atg gag aac ata aaa gtc acc aaa gga 144 
Lys Pro Tyr Glu Gly Thr Gin Met Glu Asn He Lys Val Thr Lys Gly 

35 40 45 ... . 

ggc cct ctg ccg ttc tct ttt gat ate ttg acg cct aac tgc caa atg 192 
Gly Pro Leu Pro Phe Ser Phe Asp He Leu Thr Pro Asn Cys Gin Met 

50 55 60 

gga age gta gec ata ecc aag tat aca tea ggg att cca gac tac ttt 240 
Gly Ser Val Ala lie Thr Lys Tyr Thr Ser Gly He Pro Asp Tyr Phe 
65 . 70 75 80 

aag caa tct ttt cct gaa gga ttt acc tgg gaa aga acc aca ate tac 288 
Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr lie Tyr 

85 90 95 

gaa gat ggg get tac ctt aca act caa caa gaa acc aaa ctt gat gga 336 
Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 

100 .... 105 110 

aat tgc etc gtc tac aat att aaa ate ctt gga tgt aat ttt ccc ccc 384. 
Asn Cys Leu Val Tyr Asn He Lys lie Leu Gly Cys Asn Phe Pro Pro 

115 120 125 

aat ggt cct gtg atg cag aag aaa acc caa ggc tgg gaa ccc tgt tgc 432 
Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Gys Cys 

130 135 140 

gag atg cgc tat aca cgt gat ggt gtg c.ta tgt ggc caa aca tta atg. 480 
Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 
145 .150 155 160 

gca ctt aaa tgc gec gat ggg aac cac etc act tgc cat ctg aga act 528 



EP 1 512 744 A1 

Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 

165 170 175 

act tac agg tec aaa aag gca gca aag gcg ttg cag atg cca ccc ttc 576 
Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 185 190 

cat ttt tea gac cat cgt cct gaa ata gtg aag gtt tea gag aac ggc 624 
His Phe Ser Asp His Arg Pro Glu He Val Lys Val Ser Glu Asn Gly 

195 .200 205 

aca eta ttt gaa cag cac gaa agt tea gtg gec agg tac tgt caa aca 672 
Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 

210 215 220 

tgc cca tct aaa ctt ggt cac aat taa 699 
. Cys Pro Ser Lys Leu Gly His Asn 
225 230 

<210> 15 
<211> 232 
<212> PRT 

<213> Cnidopus japoniciis 

<400> 15 - 

Met Ala Ser Lys lie Ser Asp Asn Val Arg He Lys Leu Tyr Met Glu 

1 5 10 15 

Gly Thr Val Asn Asn His His Phe Met Cys Glu Ala Glu Gly Glu Gly 

20 25 30 

Lys Pro Tyr Glu Gly Thr Gin Met Leu Asn He Lys Val Thr Lys Gly . 

35. 40 45 

Gly Pro Leu Pro Phe Ser Phe Asp He Leu Thr Pro Asn Cys Gin Met 
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50 55 60 

Gly Ser Val Ala lie Thr Lys Tyr Thr Ser Gly lie Pro Asp Tyr Gly 
65 70 75 80 

Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr He Tyr 

85 90 95 

Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 

100 .105 110 

Asn Cys Leu Val Tyr Asn lie Lys He Leu Gly Cys Asn Phe Pro Pro 

115 120 . 125 

Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Cys Cys 

130 135 140 

Glu Met Arg Tyr Thr Arg Asp Gly Val Leu. Cys Gly Gin Thr Leu Met 
145 150 .155 . ;. 160 

Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 

165 170 175 

Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 ' 185. 190 

His Phe Ser Asp His Arg Pro Glu He Val Lys Val Ser Glu Asn Gly 

195 200 205 

Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 

210 215 220 

Cys Pro Ser Lys Leu Gly His Asn 
225 230 

<210> 16 
<211> 699 
<212> DNA 
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<213> Cnidopus japonicus 
<400> 16 

atg get tec aaa ate age gac aat gta cgt ate aag tta tat atg ctg 48 

Met Ala Ser Lys lie Ser Asp Asn .Val Arg lie Lys Leu Tyr Met Glu 

1 5 10 15 

ggc aca gtc aac aat cat cac ttc atg tgc gaa get gaa gga gag ggc 96, 

Gly Thr Val Asn Asn His His Phe Met Cys Glu Ala Glu Gly Glu Gly 

20 25 30 

aag cca tac gag gga act caa atg ctt aac ata aaa gtc ace aaa gga 144 

Lys Pro Tyr Glu Gly Thr Gin Met Leu Asn lie Lys Val thr Lys Gly 

35 40 45 

ggc cct ctg ccg ttc tct ttt gat ate ttg acg cct aac tgc caa tat 192 

Gly Pro Leu Pro Phe Ser Phe Asp lie Leu Thr Pro Asn Cys Gin Met 

50 55 60 

gga age gta gee ata acc aag tat aca tea ggg att cca gac tac ggt 240 

Gly Ser Val Ala lie Thr Lys Tyr Thr Ser Gly lie Pro Asp Tyr Gly 

65 70 75 80 

aag caa tct ttt cct gaa gga ttt acc tgg gaa aga acc aca ate tac 288 

Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr lie Tyr 

85 90 95 

gaa gat ggg get tac ctt aca act caa caa gaa acc aaa ctt gat gga 336 

Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 

100 105 110 

aat tgc etc gtc tac aat att aaa ate ctt gga tgt aat ttt ccc ccc 384 

Asn Cys Leu Val Tyr Asn lie Lys He Leu Gly Cys Asn Phe Pro Pro 

115 120 125 

aat ggt cct gtg atg cag aag aaa acc caa ggc tgg gaa ccc tgt tgc. 432 
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Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Cys Cys 

130 135 140 

gag atg cgc tat aca cgt gat ggt gtg eta tgt ggc caa aca tta atg 480 
Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 
145 150 155 160 

gca ctt aaa tgc gec gat ggg aac cac etc act tgc cat ctg aga act 528 
Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 

165 170 175 

act tac agg tec aaa aag gca gca aag gcg ttg cag atg cca ccc ttc 576 
Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 185 190 

cat ttt tea gac cat cgt cct gaa ata gtg aag gtt tea gag aac ggc 624 
His Phe Ser Asp His Arg" Pro Glu He Val Lys Val Ser Glu Asn Gly 

195 200 205 

aca eta ttt gaa cag cac gaa agt tea gtg gee agg tac tgt caa aca 672 
Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 

210 215 220. 

tgc cca tct aaa ctt ggt cac aat taa 699 
Cys Pro Ser Lys Leu Gly His Asn 

225 230 • 

<210> 17 
<211> 232 
<212> PUT 

<213> Cnidopus japonicus 
<400> 17 

Met Ala Ser Lys lie Ser Asp Asn Val Arg. He Lys Leu Tyr Met Glu 
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5 10 15 

Cly Thr Val Asn Asn His His Phe Met Cys Glu Ala Glu Gly Glu Gly 

20 25 30 

Lys Pro Tyr Glu Gly Thr Gin Met- Glu Asn lie Lys Val Thr Lys Gly 

35 40 45 

Gly Pro Leu Pro Phe Ser Phe Asp lie Leu Thr Pro Asn Cys Gin Phe 

50 55 60 

Gly Ser Val Ala He Thr Lys. Tyr Thr Ser Gly He Pro Asp Tyr Phe 
65 70 75 80 

Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr lie Tyr 

85 90 • 95 

Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 

100 105 110 

Asn Cys Leu Yal Tyr Asn He Lys He Leu Gly Cys Asn Phe Pro Pro 

115 120 125 

Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Cys Cys 

130 135 140 

Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 
145 150 155 160 

Ala Leu Lys Cys Ala Asp Gly. Asn His Leu Thr Cys His Leu Arg Thr 

165 170 175 

Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 185 190 

His Phe Ser Asp His Arg Pro Glu He Val Lys Val Ser Glu Asn Gly 

195 200 205 

Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 
210 215 "• 220 
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Cys Pro Ser Lys Leu Gly His Asn 
225 230 

<210> 18 
<211> 699 
<212> DNA 

<213> Cnidopus japonieus 
<400> 18 

atg get tec aaa ate age gae aat gta cgt ate aag tta tat atg gag 48 

Met Ala Ser Lys He Ser Asp Asn Val Arg lie Lys Leu Tyr Met Glu 

... 5 , , 10 15 

ggc aca gtc aac aat cat cac ttc atg tgc gaa get gaa gga gag gge 96 

Gly Thr Yal Asn Asn His His Phe Met Cys Glu Ala Glu Gly Glu Gly 

20 25 30 

aag cca tac gag gga act caa atg gag aac ata aaa gtc acc aaa gga 144 

Lys Pro Tyr Glu Gly Thr Gin Met Glu Asn He Lys Yal Thr Lys Gly 

35 40 45 

ggc cct ctg ccg ttc tct ttt gat ate ttg acg cct aac tgc caa ttt 192 

Gly Pro Leu Pro Phe Ser Phe Asp lie Leu Thr Pro Asn Cys Gin Phe 

50 55 60 

gga age gta gee ata acc aag tat aca tea ggg att cca gae tac ttt 240 

Gly Ser Yal Ala lie Thr Lys Tyr Thr Ser Gly He Pro Asp Tyr Phe 

65 70 x ; S: 75 80 

aag caa tct ttt cct gaa gga ttt acc tgg gaa aga acc aca ate tac 288 

Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr He Tyr 

85 90 95 

gaa gat ggg get tac ctt aca act caa caa gaa acc aaa ctt gat gga 336 
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Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 

100. 105 110 

aat tgc etc gtc tac aat att aaa ate ctt gga tgt aat ttt ccc ccc 384 
Asn Cys Leu Val Tyr Asn He Lys. He Leu Gly Cys Asn Phe Pro Pro 

115 120 125 

aat ggt cct gtg atg cag aag aaa ace caa ggc tgg gaa ccc tgt tgc 432 
Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Cys Cys 
.130 . 135 140 

gag atg cgc tat aca cgt gat ggt. gtg eta tgt ggc caa aca tta atg 480 
Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 
145 150 155 160 

gca ctt aaa tgc gec gat ggg aac cac etc act tgc cat ctg aga act. 528 
Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 

165 170 175 

act tac agg tec aaa aag gca gca aag gcg ttg cag atg cca ccc ttc 576 
Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 185 190 

cat ttt tea gac cat cgt cct gaa ata gtg aag gtt tea gag aac ggc 624 
His Phe Ser Asp His Arg Pro Glu lie Val Lys Val Ser Glu Asn Gly 

195 200 205 

aca eta ttt gaa cag cac gaa agt tea gtg gee agg tac tgt caa aca 672 
Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 

210 215 220 

tgc cca tct aaa ctt ggt cac aat taa 699 
Cys Pro Ser Lys Leu Gly His Asn 
225 230 
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<2l6> 19 
<211> 232 
<212> PRT 

<213> Cnidopus japonicus 
<40O> 19 

Met Ala Ser Lys He Ser Asp Ash Val Arg lie Lys Leu Tyr Met Glu 

1 5 10 15 

Gly Thr Val Asn Asn His His Phe Met Cys Glu Ala Glu Gly Glu Gly 

20 25 30 

Lys Pro Tyr Glu Gly Thr Gin Met Glu Asn lie Lys YalThr Lys Gly 

35 40 45 

Gly Pro Leu Pro Phe Ser Phe Asp lie Leu Thr Pro Asn Cys Gin His 

50 55 .60 

Gly Ser Val Ala lie Thr Lys Tyr Thr Ser Gly lie Pro Asp Tyr Phe 
65 70 75 80 

Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr He Tyr 

85 90 95 

Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys "Leu Asp Gly 

100 105 110 

Asn Cys Leu Val Tyr Asn lie Lys He Leu Gly Cys Asn Phe Pro Pro 

115 . 120 . 125 

Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Cys Cys 

130 135 140 

Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 
145 150 155 160 

Ala Leu Lys Cys Ala Asp Gly. Asn His Leu Thr Cys His Leu Arg Thr 
.165 170 175 
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Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 '185 190 

His Phe Ser Asp His Arg Pro Glu He Val Lys Yal Ser Glu Asn Gly 

195 200 205 

Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 
210 215 220 

is Cys Pro Ser Lys Leu Gly His Asn 

225 230 

20 

<210> 20 
<211> 699 
25 <212> DNA 

<213> Cnidopus japonicus 
<400> 20 

atg get tec aaa ate age gac aat gta cgt ate aag tta tat atg gag 48 
Met Ala Ser Lys He Ser Asp Asn Val Arg He Lys Leu Tyr Met Glu 

1 5 10 \ 15 

ggc aca gtc aac aat cat cac ttc atg tgc gaa get gaa gga gag ggc 96 
Gly Thr Val Asn Asn His His Phe Met Cys Glu Ala Glu Gly Glu Gly 

20 25 30 

aag cca tac gag gga act caa atg gag aac ata aaa gtc ace aaa gga 144 
Lys Pro . Tyr Glu Gly Thr Gin Met Glu Asn He Lys Yal Thr Lys Gly 
35 40 45 

so ggc cct ctg ccg ttc tct ttt gat ate ttg acg cct aac tgc caa cat 192 

Gly Pro Leu Pro Phe Ser Phe Asp lie Leu Thr Pro Asn Cys Gin His 

50 . 55 60 

gga age gta gee ata acc aag tat aca tea ggg at t cca gac tac ttt 240 
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Gly Ser Val Ala He Thr Lys Tyr Thr Ser Gly He Pro Asp Tyr Phe 
65 70 75 80 

aag caa tct ttt cct gaa gga ttt acc tgg gaa aga acc aca ate tac 288 
Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr He Tyr 

85 90 95 

gaa gat ggg get tac ctt aca act caa caa gaa acc aaa ctt gat gga 336 
Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 

100 105 HO 

aat tgc etc gtc tac aat att aaa ate ctt gga tgt aat ttt ccc ccc 384 
Asn Cys Leu Val Tyr Asn lie Lys He Leu Gly Cys Asn Phe Pro Pro 

115 120 125 

aat ggt cct gtg atg cag aag aaa acc caa ggc tgg gaa ccc tgt tgc 432 
Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Cys Cys 

130 •-. 135 140 

gag atg cgc tat aca cgt gat ggt gtg eta tgt ggc caa aca tta atg 480 
Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 
145 150 155 160 

gca ctt aaa tgc gee gat ggg aac cac etc act tgc cat ctg aga act 528 
Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 

165 ' 170 175 

act tac agg tec aaa aag gca gca aag gcg ttg cag atg cca ccc ttc 576 
Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 185 190 

cat ttt tea gac cat cgt cct gaa ata gtg aag gtt tea gag aac ggc 624 
His Phe Ser Asp His Arg Pro Glu He Val Lys Val Ser Glu Asn Gly 

195 200 205 

aca eta. ttt gaa cag cac gaa agt tea gtg gee agg tac tgt caa aca 672 
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Thr Leu Phe Glu Gin His Clu Ser Ser Val Ala Arg Tyr Cys Gin Thr 

210 215 . 220 

tgc cca tct aaa ctt ggt cac aat taa 699 
Cys Pro Ser Lys Leu Gly His Asn 
225 230 



is <210> 21 

<211) 232 



<212> PRT 
<213> Cnidopus japonicus 
<400> 21 . 
25 Met Ala Ser Lys lie Ser Asp Asn Val Arg lie Lys Leu Tyr Met Glu 

1 5 10 15 

Gly Thr Val Asn Asn His His Phe Met Val Glu Ala Glu Gly Glu Gly 

20 25 30 

Lys Pro Tyr Glu Gly Thr Gin Met Glu Asn He Lys Val Thr Lys Gly 

35 ; 40 45 

Gly Pro Leu. Pro Phe Ser Phe Asp lie Leu Thr Pro Asn Cys Gin Met 

50 55 60 

Gly Ser Yal Ala He Thr Lys Tyr Thr Ser Gly lie Pro Asp Tyr Phe 
65 70 75 80 

Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr He Tyr 
85 90 95 

so Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 

100 105 . 110 

Asn Cys Leu Yal Tyr Asn lie Lys He Leu Gly Cys Asn Phe Pro Pro 
115 120 125 
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Asn Cly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Ser Cys 

130 135 140 

Glu Met Arg Tyr. Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 
145 150 155 160 

Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 

165 170 ' 175 

Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Glh Met Pro Pro Phe ; 

180 185 190 

His Phe Ser Asp His Arg Leu Glu lie Val Lys Val Ser Glu Asn Gly 

195 200 205 

Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 

210 215 220 

Cys Pro Ser Lys Leu Gly His Asn 
225 230 

<210> 22 

<211> 699 : 
<212> DNA 

<213> Cnidopus japonicus 
<400> 22 

atg get tec aaa ate age gac aat gta cgt ate aag tta tat atg gag 48 
Met Ala Ser Lys lie Ser Asp Asn Val Arg lie Lys Leu Tyr Met Glu 

1 5 10 15 

ggc aca gtc aac aat cat cac ttc atg gtc gaa get gaa gga gag ggc 96 
Gly Thr Val Asn Asn His His Phe Met Val Glu Ala Glu Gly Glu Gly 

20 25 30 

aag cca tac gag gga act caa atg gag aac ata aaa gtc acc aaa gga 144 
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Lys Pro Tyr Glu Gly Thr Gin Met Glu Asn He Lys Val Thr Lys Gly 

. 35 40" 45 

ggc cct ctg ccg ttc tct ttt gat ate ttg acg cct aac tgc caa tat 192 

Gly Pro Leu Pro Phe Ser Phe Asp lie Leu Thr Pro Asn Cys Gin Met 

50 55 60 

gga age gta gec ata acc aag tat aca tea ggg att cca gac tac ttt 240 

Gly Ser Yal Ala lie Thr Lys Tyr Thr Ser Gly lie Pro Asp Tyr Phe 

65 70 75 80 

aag caa tct ttt cct gaa gga ttt acc tgg gaa aga acc aca ate tac 288 

Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr lie Tyr 

85 90 95 

25 gaa gat ggg get tac ctt aca act caa caa gaa acc aaa ctt gat gga 336 

Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 

100 105 110 

30 ' 

aat tgc etc gtc tac aat att aaa ate ctt gga tgt aat ttt ccc ccc 384 

Asn Cys Leu Val Tyr Asn He Lys lie Leu Gly Cys Asn Phe Pro Pro 
35 115 ' 120 125 

aat ggt cct gtg atg cag aag aaa acc caa ggc tgg gaa ccc agt tgc 432 

Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Ser Cys 

40 

130 135 140 

gag atg cgc tat aca cgt gat ggt gtg eta tgt ggc caa aca tta atg 480 

<5 Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 

145 150 . 155 160 

gca ctt aaa tgc gec gat ggg aac cac etc act tgc cat ctg aga act 528 

Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 

165 170 175 

act tac agg tec aaa aag gca gca aag gcg ttg cag atg cca ccc ttc 576 
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Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 185 190 

cat ttt tea gac cat cgt ctt gaa ata gtg aag gtt tea gag aac ggc 624 

His Phe Ser Asp His Arg Leu Glu He Val Lys Val Ser Glu'Asn.Gly 

195 200 205 

aca eta ttt gaa cag cac gaa agt tea gtg gee agg tac tgt caa aca 672 

15 . Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 

210 215 220 

tgc cca. tct aaa ctt ggt cac aat taa. .699 

Cys Pro Ser Lys Leu Gly His Asn 
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Claims 

30 1. A chromoprotein derived from Cnidopus japonicus having the following properties: 

(1) the absorption maximum wavelength is 610 nm, and fluorescence is hot emitted; . 

(2) the moiar absorption coefficient is 66,700 at 610 nm; and . 

(3) the pH sensitivity of light-absorbing property is stable at between pH 4 and pH 10. 



2. A chromoprotein having either one of the following amino acid sequences: 



(a) the amino acid sequence shown in SEQ ID NO: 1 ; and 

(b) an amino acid sequence comprising a deletion, substitution and/or addition of one or several amino acids 
with respect to the amino acid sequence shown in SEQ ID NO: 1 , and having light-absorbing properties. 

3. A chromoprotein capable of emitting fluorescence,. which has an amino acid sequence, wherein, with respect to 
the amino acid sequence shown in SEQ ID NO: 1 , alanine as an amino acid residue at position 28 is substituted 
by glycine, glutamic acid as an amino acid residue at position 41 is substituted by methionine, cysteine as an 

45 amino acid residue at position 145 is substituted by serine, and threonine as an amino acid residue at position 158 

is substituted by isoleucine. . 

4. A chromoprotein having an amino acid sequence wherein tyrosine as an amino acid residue at position 64 is 
. substituted by leucine with respect to the amino acid sequence shown in SEQ ID NO: 1. 



5. A chromoprotein having an amino acid sequence wherein tyrosine as an amino acid residue at position 64 is 
substituted by methionine with respect to the amino acid sequence shown in SEQ ID NO: 1 . 

6. A chromoprotein having an amino acid sequence, wherein glutamic acid as an amino acid residue at position 41 
is substituted by leucine, and phenylalanine as an amino acid residue at position 80 is substituted by glycine, with 
respect to the amino acid sequence shown in SEQ ID NO: 1 . 

7. A chromoprotein capable of emitting fluorescence, which has an amino acid sequence wherein tyrosine as an 
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amino acid residue at position 64 is substituted by phenylalanine with respect to the amino acid sequence shown 
in SEQ ID NO: 1 . 

8. A chromoprotein capable of emitting fluorescence, which has an amino acid sequence wherein tyrosine as an 
amino acid residue at position 64 is substituted by histidine with respect to the amino acid sequence shown in 
SEQ ID NO: 1. 

9. A chromoprotein capable of emitting fluorescence, which has an amino acid sequence, wherein cysteine as an 
amino acid residue at position 26 is substituted by valine, cysteine as an amino acid residue at position 143 is 
substituted by serine, and proline as an amino acid residue at position 1 99 is substituted by leucine, with respect 
to the amino acid sequence shown in SEQ ID NO: .1. 



10. A DNA encoding the protein of any of claims 1 to 9. 
is 11. A DNA of either one of followings: 

(a) DNA encoding the amino acid sequence shown in SEQ ID NO: 1 ; and 

(b) DNA encoding an amino acid sequence, which comprises a deletion, substitution and/or addition of one 
.or several amino acids with respect to the amino acid sequence shown in SEQ ID NO: 1 , and has Hght-ab- 

20 sorbing properties. 

12. A DNA having either one of the following nucleotide sequences: 



(a) the nucleotide sequence shown in SEQ ID NO: 2; and 

(b) a nucleotide sequence comprising a deletion, substitution and/or addition of one or several nucleotides 
with respectto the nucleotide sequence shown in SEQ ID NO: 2, and encoding a protein having light-absorbing 
properties. . 

13. A DNA having the nucleotide sequence shown in any one of SEQ ID NOS: 12, 14, 16, 18, 20, or 22. 

14. A recombinant vector having the DNA of any of claims 10 to 13. 

15. A transformant having the DNA of any of claims 10 to 13 or the recombinant vector of claim 14^ 

as 16. A fusion protein composed of the chromoprotein of any of claims 1 to 9 and another protein. 

1 7. A method for analyzing a physiologically active substance, which is characterized in that the FRET (fluorescence 
resonance energy transfer) method is applied using the chromoprotein of any of claims 1 to 9 as an acceptor protein. 

40 18. A light-absorbing reagent kit comprising the chromoprotein of any of claims 1 to 9, the DNA of any of claims 10 to 
.13, the recombinant vector of claim 1 4, the transformant of daim 1 5, or the fusion protein of claim 16. 
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Fig. 7 
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